Transcription Methods for Consistency, Volume and Efficiency

نویسندگان

  • Meghan Lammie Glenn
  • Stephanie Strassel
  • Haejoong Lee
  • Kazuaki Maeda
  • Ramez Zakhary
  • Xuansong Li
چکیده

This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania to create manual transcripts as a shared resource for human language technology research and evaluation. Speech recognition and related technologies in particular call for substantial volumes of transcribed speech for use in system development, and for human gold standard references for evaluating performance over time. Over the past several years LDC has developed a number of transcription approaches to support the varied goals of speech technology evaluation programs in multiple languages and genres. We describe each transcription method in detail, and report on the results of a comparative analysis of transcriber consistency and efficiency, for two transcription methods in three languages and five genres. Our findings suggest that transcripts for planned speech are generally more consistent than those for spontaneous speech, and that careful transcription methods result in higher rates of agreement when compared to quick transcription methods. We conclude with a general discussion of factors contributing to transcription quality, efficiency and consistency.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficiency determination of HPGe detector by simulation and experimental methods for solid volume source

Analyzes of environmental samples regarding their radioactivity is of important concern for health purposes. We need standard sources to determine radioactive components and their activities. These sources are usually produced regarding type of the sample. One of the fundamental and precise tools to recognize radioactive materials and their activities is HPGe detector. To reach this goal, the d...

متن کامل

Comparison between single and double flow plane solar heaters considering gas radiation effect

ABSTRACT: In this paper, the thermal characteristics of single and double flow plane solar heaters with radiating working gas were analyzed and compared by numerical analysis for the first time. The laminar mixed convection gas flow in the heaters was numerically simulated by the CFD method using the finite volume technique. The set of governing equations included the conservation of mass, mome...

متن کامل

Anti-cancer effects of the combined treatment of trastuzumab and decoy oligodeoxynucleotides to target STAT3 transcription factor on SK-BR-3 breast cancer cell line

Introduction: Breast cancer is the most common malignancy in the female population and is the leading cause of death. Surgery, chemotherapy, radiotherapy, and monoclonal antibody (trastuzumab) therapy are common and standard treatments for this cancer. However, there are significant limitations in the treatment of this disease by using regular methods. Given the role of transcription factors (T...

متن کامل

Orthographic Transcription of the Spoken Dutch Corpus

This paper focuses on the specification of the orthographic transcription task in the Spoken Dutch Corpus, the problems encountered in making that specification and the evaluation experiments that were carried out to assess the transcription efficiency and the intertranscriber consistency. It is stated that the role of the orthographic transcriptions in the Spoken Dutch Corpus is twofold: on th...

متن کامل

"Technical Report" Flood Hydrograph Simulation Using HEC-HMS Model in Sarbaz River Basin of Sistan and Baluchestan Province

   Hydrological models are simplified representation of the real basin system, which helps to assess basin function in response to different inputs and better understanding of hydrological processes. The HEC-HMS model is one of the most important hydrological models for Flood estimating volumes and discharge in watersheds. In this research, the HEC-HMS hydrologic model was used to simulate the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010